skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Ting, Yuan-Sen"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Abstract AstroSage-Llama-3.1-8B is a domain-specialized natural-language AI assistant tailored for research in astronomy, astrophysics, cosmology, and astronomical instrumentation. Trained on the complete collection of astronomy-related arXiv papers from 2007 to 2024 along with millions of synthetically-generated question-answer pairs and other astronomical literature, AstroSage-Llama-3.1-8B demonstrates remarkable proficiency on a wide range of questions. AstroSage-Llama-3.1-8B scores 80.9% on the AstroMLab-1 benchmark, greatly outperforming all models—proprietary and open-weight—in the 8-billion parameter class, and performing on par with GPT-4o. This achievement demonstrates the potential of domain specialization in AI, suggesting that focused training can yield capabilities exceeding those of much larger, general-purpose models. AstroSage-Llama-3.1-8B is freely available, enabling widespread access to advanced AI capabilities for astronomical education and research. 
    more » « less
    Free, publicly-accessible full text available December 1, 2026
  2. Neural network-based emulators for the inference of stellar parameters and elemental abundances represent an increasingly popular methodology in modern spectroscopic surveys. However, these approaches are often constrained by their emulation precision and domain transfer capabilities. Greater generalizability has previously been achieved only with significantly larger model architectures, as demonstrated by Transformer-based models in natural language processing. This observation aligns with neural scaling laws, where model performance predictably improves with increased model size, computational resources allocated to model training, and training data volume. In this study, we demonstrate that these scaling laws also apply to Transformer-based spectral emulators in astronomy. Building upon our previous work with TransformerPayne and incorporating Maximum Update Parametrization techniques from natural language models, we provide training guidelines for scaling models to achieve optimal performance. Our results show that within the explored parameter space, clear scaling relationships emerge. These findings suggest that optimal computational resource allocation requires balanced scaling. Specifically, given a tenfold increase in training compute, achieving an optimal seven-fold reduction in mean squared error necessitates an approximately 2.5-fold increase in dataset size and a 3.8-fold increase in model size. This study establishes a foundation for developing spectral foundational models with enhanced domain transfer capabilities. 
    more » « less
    Free, publicly-accessible full text available January 1, 2026
  3. Dwarf galaxy star formation histories are theoretically expected to be bursty, potentially leaving distinct imprints on their chemical evolution. We propose that episodic starbursts with quiescent periods longer than ~100 Myr should lead to discontinuous tracks in a dwarf galaxy’s [ α /Fe]-[Fe/H] chemical abundance plane, with metallicity gaps as large as 0.3-0.5 dex at [Fe/H] = -2. This occurs due to continued Fe production by Type Ia supernovae during quiescent periods. We demonstrate that Gaussian mixture models can statistically distinguish discontinuous and continuous tracks based on the Akaike Information Criterion. Applying this method to APOGEE observations of the Sculptor dSph galaxy suggests an episodic star formation history with ~300 Myr quiescent periods. While current dwarf galaxy datasets are limited by small spectroscopic sample sizes, future surveys and extremely large telescopes will enable determining large numbers of precise chemical abundances, opening up the investigation of very short timescales in early dwarf galaxy formation. This unprecedentedly high time resolution of dwarf galaxy formation in the early Universe has important implications for understanding both reionization in the early Universe and the episodic star formation cycle of dwarf galaxies. 
    more » « less
    Free, publicly-accessible full text available January 1, 2026
  4. Free, publicly-accessible full text available May 23, 2026
  5. Abstract We present a proof-of-concept simulation-based inference on Ωmandσ8from the Sloan Digital Sky Survey (SDSS) Baryon Oscillation Spectroscopic Survey (BOSS) LOWZ Northern Galactic Cap (NGC) catalog using neural networks and domain generalization techniques without the need of summary statistics. Using rapid light-cone simulations L-picola, mock galaxy catalogs are produced that fully incorporate the observational effects. The collection of galaxies is fed as input to a point cloud-based network,Minkowski-PointNet. We also add relatively more accurate Gadgetmocks to obtain robust and generalizable neural networks. By explicitly learning the representations that reduce the discrepancies between the two different data sets via the semantic alignment loss term, we show that the latent space configuration aligns into a single plane in which the two cosmological parameters form clear axes. Consequently, during inference, the SDSS BOSS LOWZ NGC catalog maps onto the plane, demonstrating effective generalization and improving prediction accuracy compared to non-generalized models. Results from the ensemble of 25 independently trained machines find Ωm= 0.339 ± 0.056 andσ8= 0.801 ± 0.061, inferred only from the distribution of galaxies in the light-cone slices without relying on any indirect summary statistics. A single machine that best adapts to the Gadgetmocks yields a tighter prediction of Ωm= 0.282 ± 0.014 andσ8= 0.786 ± 0.036. We emphasize that adaptation across multiple domains can enhance the robustness of the neural networks in observational data. 
    more » « less
  6. ABSTRACT Chemical abundance anomalies in twin stars have recently been considered tell-tale signs of interactions between stars and planets. While such signals are prevalent, their nature remains a subject of debate. On the one hand, exoplanet formation may induce chemical depletion in host stars by locking up refractory elements. On the other hand, exoplanet engulfment can result in chemical enrichment, and both processes potentially produce similar differential signals. In this study, we aim to observationally disentangle these processes by using the Ca ii infrared triplet to measure the magnetic activity of 125 co-moving star pairs with high signal-to-noise ratio, and high-resolution spectra from the Magellan, Keck, and VLT (Very Large Telescope) telescopes. We find that co-natal star pairs in which the two stars exhibit significant chemical abundance differences also show differences in their magnetic activity, with stars depleted in refractories being magnetically more active. Furthermore, the strength of this correlation between differential chemical abundances and differential magnetic activity increases with condensation temperature. One possible explanation is that the chemical anomaly signature may be linked to planet formation, wherein refractory elements are locked into planets, and the host stars become more active due to more efficient contraction during the pre-main-sequence phase or star–planet tidal and magnetic interactions. 
    more » « less
  7. ABSTRACT We present the first detailed chemical-abundance analysis of stars from the dwarf-galaxy stellar stream Wukong/LMS-1 covering a wide metallicity range ($$-3.5 \lt \rm [Fe/H] \lesssim -1.3$$). We find abundance patterns that are effectively indistinguishable from the bulk of Indus and Jhelum, a pair of smaller stellar streams proposed to be dynamically associated with Wukong/LMS-1. We confirmed a carbon-enhanced metal-poor star ($$\rm [C/Fe] \gt +0.7$$ and $$\rm [Fe/H] \sim -2.9$$) in Wukong/LMS-1 with strong enhancements in Sr, Y, and Zr, which is peculiar given its solar-level [Ba/Fe]. Wukong/LMS-1 stars have high abundances of α elements up to $$\rm [Fe/H] \gtrsim -2$$, which is expected for relatively massive dwarfs. Towards the high-metallicity end, Wukong/LMS-1 becomes α-poor, revealing that it probably experienced fairly standard chemical evolution. We identified a pair of N- and Na-rich stars in Wukong/LMS-1, reminiscent of multiple stellar populations in globular clusters. This indicates that this dwarf galaxy contained at least one globular cluster that was completely disrupted in addition to two intact ones previously known to be associated with Wukong/LMS-1, which is possibly connected to similar evidence found in Indus. From these ≥3 globular clusters, we estimate the total mass of Wukong/LMS-1 to be $${\approx }10^{10} \, \mathrm{M}_\odot$$, representing ∼1 per cent of the present-day Milky Way. Finally, the [Eu/Mg] ratio in Wukong/LMS-1 continuously increases with metallicity, making this the first example of a dwarf galaxy where the production of r-process elements is clearly dominated by delayed sources, presumably neutron-star mergers. 
    more » « less
  8. ABSTRACT Carbon-enhanced metal-poor (CEMP) stars comprise almost a third of stars with [Fe/H] < −2, although their origins are still poorly understood. It is highly likely that one sub-class (CEMP-s stars) is tied to mass-transfer events in binary stars, while another sub-class (CEMP-no stars) are enriched by the nucleosynthetic yields of the first generations of stars. Previous studies of CEMP stars have primarily concentrated on the Galactic halo, but more recently they have also been detected in the thick disc and bulge components of the Milky Way. Gaia DR3 has provided an unprecedented sample of over 200 million low-resolution (R ≈ 50) spectra from the BP and RP photometers. Training on the CEMP catalogue from the SDSS/SEGUE database, we use XGBoost to identify the largest all-sky sample of CEMP candidate stars to date. In total, we find 58 872 CEMP star candidates, with an estimated contamination rate of 12 per cent. When comparing to literature high-resolution catalogues, we positively identify 60–68 per cent of the CEMP stars in the data, validating our results and indicating a high completeness rate. Our final catalogue of CEMP candidates spans from the inner to outer Milky Way, with distances as close as r ∼ 0.8 kpc from the Galactic centre, and as far as r > 30 kpc. Future higher resolution spectroscopic follow-up of these candidates will provide validations of their classification and enable investigations of the frequency of CEMP-s and CEMP-no stars throughout the Galaxy, to further constrain the nature of their progenitors. 
    more » « less
  9. ABSTRACT We model the stellar abundances and ages of two disrupted dwarf galaxies in the Milky Way stellar halo: Gaia-Sausage Enceladus (GSE) and Wukong/LMS-1. Using a statistically robust likelihood function, we fit one-zone models of galactic chemical evolution with exponential infall histories to both systems, deriving e-folding time-scales of τin = 1.01 ± 0.13 Gyr for GSE and $$\tau _\text{in} = 3.08^{+3.19}_{-1.16}$$ Gyr for Wukong/LMS-1. GSE formed stars for $$\tau _\text{tot} = 5.40^{+0.32}_{-0.31}$$ Gyr, sustaining star formation for ∼1.5–2 Gyr after its first infall into the Milky Way ∼10 Gyr ago. Our fit suggests that star formation lasted for $$\tau _\text{tot} = 3.36^{+0.55}_{-0.47}$$ Gyr in Wukong/LMS-1, though our sample does not contain any age measurements. The differences in evolutionary parameters between the two are qualitatively consistent with trends with stellar mass M⋆ predicted by simulations and semi-analytic models of galaxy formation. Our inferred values of the outflow mass-loading factor reasonably match $$\eta \propto M_\star ^{-1/3}$$ as predicted by galactic wind models. Our fitting method is based only on Poisson sampling from an evolutionary track and requires no binning of the data. We demonstrate its accuracy by testing against mock data, showing that it accurately recovers the input model across a broad range of sample sizes (20 ≤ N ≤ 2000) and measurement uncertainties (0.01 ≤ σ[α/Fe], σ[Fe/H] ≤ 0.5; $$0.02 \le \sigma _{\log _{10}(\text{age})} \le 1$$). Due to the generic nature of our derivation, this likelihood function should be applicable to one-zone models of any parametrization and easily extensible to other astrophysical models which predict tracks in some observed space. 
    more » « less
  10. Abstract Some studies of stars’ multielement abundance distributions suggest at least 5–7 significant dimensions, but others show that many elemental abundances can be predicted to high accuracy from [Fe/H] and [Mg/Fe] (or [Fe/H] and age) alone. We show that both propositions can be, and are, simultaneously true. We adopt a machine-learning technique known as normalizing flow to reconstruct the probability distribution of Milky Way disk stars in the space of 15 elemental abundances measured by APOGEE. Conditioning on T eff and log g minimizes the differential systematics. After further conditioning on [Fe/H] and [Mg/Fe], the residual scatter for most abundances is σ [ X /H] ≲ 0.02 dex, consistent with APOGEE’s reported statistical uncertainties of ∼0.01–0.015 dex and intrinsic scatter of 0.01–0.02 dex. Despite the small scatter, residual abundances display clear correlations between elements, which we show are too large to be explained by measurement uncertainties or by the finite sampling noise. We must condition on at least seven elements to reduce the correlations to a level consistent with the observational uncertainties. Our results demonstrate that cross-element correlations are a much more sensitive probe of a hidden structure than dispersion, and they can be measured precisely in a large sample even if the star-by-star measurement noise is comparable to the intrinsic scatter. We conclude that many elements have an independent story to tell, even for the mundane disk stars and elements produced by the core-collapse and Type Ia supernovae. The only way to learn these lessons is to measure the abundances directly, and not merely infer them. 
    more » « less